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METHOD, SYSTEM, AND PROGRAM FOR 
DATA SYNCHRONIZATION 



BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

[0001] The present invention is related to improved data synchronization. 



2. Description of the Related Art 

[0002] In typical disaster recovery solutions, data is housed at a primary site as well as at 

10 one or more secondary sites. These secondary sites maintain a synchronized copy of the 
data such that no data is lost in the case of a disaster at the primary site. If a disaster 
occurs, processing is either "failed-over" to one of the secondary sites or the data is 
copied from the secondary site back to the primary site. In order for disaster recovery to 
be effective, the secondary sites are typically far away from the primary site so that both 

1 5 sites are not affected by the same disaster. 

[0003] Disaster recovery systems typically address two types of failures, a sudden 
catastrophic failure at a single point in time or data loss over a period of time. In the 
second type of gradual disaster, updates to volumes may be lost. For either type of 
failure, a copy of data may be available at a remote location. Such dual or shadow 

20 copies are typically made as the application system is writing new data to a primary 
storage device at a primary site. A storage device is a physical unit that provides a 
mechanism to store data on a given medium, such that the data can be subsequently 
retrieved. International Business Machines Corporation (IBM), the assignee of the 
subject patent application, provides systems for maintaining remote copies of data at a 

25 secondary storage device, including extended remote copy (XRC®) and peer-to-peer 
remote copy (PPRC). 

[0004] These systems provide techniques for recovering data updates between a last, safe 
backup and a system failure. Such data shadowing systems can also provide an additional 
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remote copy for non-recovery purposes, such as local access at a remote site. The IBM 
XRC and PPRC systems are described further in z/OS V1R1.0 DFSMS Advanced Copy 
Services (IBM Document Number SC3 5-0428-00), April 2001, which is available from 
International Business Machines Corporation. 
5 [0005] In such backup systems, data is maintained in volume pairs. A volume pair is 
comprised of a volume in a primary storage device and a corresponding volume in a 
secondary storage device that includes a consistent copy of the data maintained in the 
primary volume. Typically, the primary volume of the pair will be maintained in a 
primary storage control unit, and the secondary volume of the pair is maintained in a 
10 secondary storage control unit at a different physical location than the primary storage 
control unit. A storage control unit is a physical hardware unit that consists of a storage 
server integrated with one or more storage devices to provide storage capability to a host 
computer. A storage server is a physical unit that provides an interface between one or 
more storage devices and a host computer by providing the function of one or more 
15 logical subsystems. The storage server may provide functions that are not provided by 
the storage device. The storage server is composed of one or more clusters of storage 
devices. A primary storage control unit may be provided to control access to the primary 
DASD and a secondary storage control unit may be provided to control access to the 
secondary DASD. 

20 [0006] It is important that all secondary data sites are synchronized and contain an exact 
copy of the primary site's data. Sometimes, however, errors occur that cause the system 
to not know whether the primary and secondary sites are synchronized. In typical 
disaster recovery solutions, if a secondary site loses certainty of synchronization with the 
primary site, all of the data must be copied from the primary site to the secondary site. 

25 For large systems that are typical for large corporations, the time required to 

resynchronize the two sites is enormous due to the tremendous amount of data that must 
now be copied. Besides the time it takes to carry out this copy, it must also be 
remembered that as a result of this recopy, the data link between the two sites is being 
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used much more heavily than is typical. This also causes normal processing that 
continues to be impacted since the bandwidth necessary to continue may no longer be 
available. 

[0007] In particular, in prior art systems, when two volumes lose synchronization for any 
5 reason, it is necessary for the primary site to send the entire volume of data to the 
secondary site. If many volumes are affected and/or the volumes are very large, this 
could take a considerable amount of time. Not only will it take a long time, but all the 
data being sent will increase the bandwidth used on the long distance data link 
tremendously. If the system does not have a good amount of extra bandwidth, and, 
10 typically, conventional systems do not, then this resynchronization would impact all 
other processing and disaster recovery mirroring currently happening in the system as 
well Furthermore, in most cases of lost synchronization, very little of the data, rf any, is 
actually out of synchronization. As a result, the entire volume of data will be recopied 
when only a few portions of data are actually not identical. 
15 [0008] Thus, there is a need for improved data synchronization. 

ST TMMARY OF T HF. TNVENTION 
[0009] Provided aie a method, system, and program for data synchronization. A first 
identifier is determined for a portion of data at a first source. A second identifier is 
20 determined for a portion of corresponding data at a second source. The first and second 
identifiers are compared. When the first and second identifiers do not match, the portion 
of corresponding data at the second source is replaced with the portion of data at the first 



source. 

[0010] The described implementations of the invention provide a method, system, and 
25 program for determining whether or not first and second data sets (e.g., volumes) are 
synchronized and, if the first and second datasets are not synchronized, to identify which 
data needs to be synchronized. Thus, if only portions of the first and second data sets 
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need .0 be synchronized, implementations of the invention eopy only those portions from 
one data set to the other. 

ppTpc r>P grp. TPTTON OF THE DRAWINGS 
5 Referring now to the drawings in which like reference numbers represent 

corresponding parts throughout: 

FIG. 1 illustrates, in a block diagram, a computing environment in accordance 

with certain implementations of the invention. 

FIGs 2Aand2B illustrate logic for data synchronization implemented in at least 

10 one synchronization system in accordance with certain implementations of the invention. 
FIGs. 3 A and 3B illustrate, in block diagrams, unique identifier generation 
techniques in accordance with certain implementations of the invention. 

FIG. 4 illustrates, in a block diagram, a source space and a target space in 
accordance with certain implementations of the invention. 
15 FIG. 5 illustrates an architecture of a computer system that may be used in 

accordance with certain implementations of the invention. 

DETAILED DESCRIPTION 
[0011] In the following description, reference is made to the accompanying drawings 
20 which form a part hereof and which illustrate several implementations of the present 
invention. It is understood that other implementations may be utilized and structural and 
operational changes may be made without departing from the scope of the present 
invention. 

[0012] FIG. 1 illustrates, in a block diagram, a computing environment in accordance 
25 with some implementations of the invention. Primary control units 1 10, 130, and 150 
provide one or more host computers 1 16, 136, or 156, respectively, access to primary 
storage devices 1 14, 134, and 154, respectively. The primary storage devices may be, for 
example, primary DASD. One or more applications may be executing on each host 
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computer 1 16, 136, or 156 to update data on the primary storage devices. The primary 
control units 110, 130, and 150 also have primary caches 1 12, 132, and 152, respectively, 
in which updates to tracks in the primary storage devices 1 14, 134, and 154, respectively, 
are maintained until destaged to the primary storage devices 1 14, 134, and 154. Back-up 
5 copies of all or a subset of the volumes of the primary storage devices 1 14, 134, and 154 
may be maintained at secondary storage devices 124, 144, and 164, respectively. 
[0013] A primary site may include primary control units 1 10, 130, and 150, primary 
storage device 1 14, 134, and 154, and host computers 116, 136, and 156. A secondary 
site may include secondary control units 120, 140, and 160, and secondary storage 
10 devices 124, 144, and 164. The secondary control units 120, 140, and 160 also have 

secondary caches 122, 142, and 162. Operator computers (not shown) may be attached to 
the secondary control units 120, 140, and 160. The secondary storage devices maybe, 
for example, secondary DASD. 

[0014] In some implementations, a "volume pair' 1 comprises a volume in the primary 
15 storage device and the shadow of that volume in the secondary storage device that 

maintains a back-up copy of the data at the primary volume in the volume pair. In certain 
implementations, for each primary volume, there is a corresponding secondary volume 
with consistent contents; this is a volume pair. For example, updates to the primary 
storage device 1 14 are transferred to the secondary control unit 120 for storage in the 
20 secondary storage device 124. 

[0015] The primary control units 110, 130, and 150 may further include a non- volatile 
storage unit (not shown), such as a battery-backed up volatile memory, to maintain a 
non- volatile copy of data updates. In certain implementations of the invention, the 
primary control units 1 10, 130, and 150 and secondary control units 120, 140, and 160 
25 may be comprised of the IBM TotalStorage 2105 Enterprise Storage Server (2105 ESS), 
which is available from International Business Machines Corporation. 
[0016] Each primary control unit 1 10, 130, and 150 and secondary control unit 120, 140, 
and 160 includes a synchronization system 111, 121, 131, 141, 151, and 161 for 
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providing synchronization of data. In certain implementations of the invenoon, dre 
synchronization system 111, 12!, 13!, 141, ISl.and 161 comprises a data storage 
controller (i.e., a storage control nni.) and includes a nnique identifier generator. 
,00171 ,n certain implemen,ations,,„e primary comro. units 110, 130, 150 and secondary 
5 con.ro. units 120, 140, and 160 communicate via communication paths, such as drree. 
Ugh speed transmissron lines (e.g., an Enterprise System Connection (ESCON®) hnk). 
However, the commumcafion paths may he comprised of any other communicaUon 
means known h> the art, including network transmission lines, finer optic cables, etc. 
[00181 Updates made to primary storage devices 1 14, 134, and 154 are transferred to 
,„ secondary control units 120, 140, and 160. The transfer of updates may occur ,n any 
nranne, In certain implementahons, backup data may he uansferred from any pnmary 
control unit 1 10, 130, and 150 to any secondary control unit 120, 140, and 160. Each 
volume Horn each primary storage device corresponds to a volume on a secondary 
storage device. For example, a firs, volume on primary storage device 1 14 is backed up 
,5 ro a volume on secondary storage device 124, and a second volume on primary storage 
device 134 is backed up to a volume on secondary storage device 144. Incertam 
implementations, one or more volumes are not backed up. 

[00191 FIGs. 2A and 2B illustrate logic for data synchronization implemented m at least 
one synchronization system in accordance with certain implementations of me iuvenbon. 
20 For ease of reference, the terms -primal and "secondary" are used herem. 

[mplememations of the invention are applicable to synchronizing data between any two 
sources (e.g„ a firs, source and a second source). Conb-ol begins a, block 200 with a 
determination tha, ,. is time to synctaonize data. In certain implementations, the 
determination tha, i. is time «o synchronize data is made when an error occurs (e.g„ data 
25 is lost while being transferted from the primal control unit ha the secondary contml urn. 
or data is corrupted a. one control unit). In certain alternative implementations of the 
invention, me determination drat it is time .o synchronize data is made periodically. 
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[0020] In block 202, a next portion of primary data and corresponding secondary data is 
selected, starting with a first portion. Primary data refers to data on a primary storage 
device, and secondary data refers to data on a secondary storage device. In certam 
implementations of the invention, the portion of data is a track on a volume. In block 
5 204, whether all portions of data have been selected is determined. If so, processing is 
done, otherwise, processing continues to block 206. 

[0021] In block 206, a unique identifier is determined for the portion of primary data. In 
block 208, a unique identifier is determined for the corresponding portion of secondary 
data In certain implementations of the invention, the unique identifiers are hash keys. 
10 That is, a hash key is associated with each portion of data on each of the volumes. The 
length of data for which a hash key is created can vary, as can the length of the hash key. 
In certain implementations of the invention, there is one hash key created for each 
portion of data (e.g., each track in each volume), and the hash key is 512 bits in length. 
Thus, there is a hash key created for each portion of data on the primary volume and one 
15 for each portion of data on the corresponding secondary volume. 

[0022] In block 210, the unique identifiers for the portions of primary and corresponding 
secondary data are compared. In implementations of the inventions in which the unique 
identifiers are hash keys, the hash keys are compared to determine whether the portions 
of data are synchronized. In block 212, it is determined whether the unique identifiers 
20 match. If there is a match, processing continues to block 202, otherwise, processing 

continues to block 214. That is, for each portion of data (e.g., track) for which the unique 
identifiers (e.g., hash keys) match, implementations of the invention recognize that the 
portions of data (e.g., tracks) are synchronized (i.e., are identical). In block 214, the 
portion of primary data is resent from the primary volume to the secondary volume. 
25 Thus copying an entire volume for resynchronization of data is avoided. 

[0023] In certain implementations of the invention, the logic of FIGs. 2A and 2B occurs 
at a synchronization system at a secondary control unit. In such implementations, the 
secondary control unit selects portions of data for comparison (blocks 202, 204), requests 
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the unique identifier for the portion of primary data from the primary control umt (block 
206) determines the unique identifier for the portion of corresponding secondary data 
(block 208), and compares the unique identifiers (block 210). If the unique identifiers do 
not match (block 212), the secondary control unit requests the portion of primary data 
5 from the primary control unit (block 214). 

[0024] In certain alternative implementations, the logic of FIGs. 2A and 2B occurs at a 
synchronization system at a primary control unit, and data from a secondary control unit 
may be copied to the primary control unit. For example, the primary control unit selects 
portions of data for comparison (blocks 202, 204), requests the unique identifier for the 
1 0 portion of secondary data from the secondary control unit (block 206), determines the 
unique identifier for the portion of corresponding primary data (block 208), and compares 
the unique identifiers (block 210). If the unique identifiers do not match (block 212), the 
primary control unit sends the portion of primary data to the secondary control unit 
(block 214). 

15 [0025] In certain additional implementations, a separate synchronization system resides 
at a computer system other than the primary control unit or the secondary control unit. 
The separate synchronization system may also reside outside of the primary site and the 
secondary site. The separate synchronization system requests unique identifiers for a 
portion of data from the primary and secondary control units and directs the primary 

20 control unit to send the portion of data to the secondary control unit if the unique 
identifiers for the portion of data did not match. 

[0026] There are a number of variations for implementations of the invention. In certain 
implementations of the invention, whenever a portion of data is updated, a unique 
identifier for the portion of data is generated and stored (e.g., in the storage devices or in 
25 cache) for future use. In certain alternative implementations of the invention, the unique 
identifiers are generated when there is a concern about data integrity. At that time, the 
synchronization systems at both the primary and the secondary control units create 
unique identifiers for each portion of data of each volume for which there is a concern 
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about data integrity. Then, at least one of the synchronization systems compares the 
unique identifiers to determine whether synchronization is required. In yet other 
alternative implementations, unique identifiers are generated/updated periodically. In 
these implementations, whenever updates occur to a portion of data, it is noted that the 
5 unique identifier is no longer valid. When the synchronization system is idle or at 
predetermined intervals, the synchronization system updates the unique identifiers that 
are no longer valid. Also, when unique identifiers are requested, any unique identifiers 
that are not valid may be updated at the time of the request. 

(0027] Certain implementations of the invention, in which the unique identifiers are hash 
10 keys ensure that a match of two hash keys for a portion of data means that the two 
portions of data are identical. Furthermore, any hash key generation technique may be 
used with implementations of the invention, including, for example, the MD5 and Secure 
Hash Algorithm-1 (SHA-1) hash functions. MD5 is a well known one-way hash function 
that takes a portion of data and converts the portion of data into a fixed string of digits 
15 (which is also referred to as a message digest). SHA-1 is a well known hash function that 
takes a portion of data of less than 2" bits in length and produces a 160-bit message 
digest. In certain implementations of the invention, the probability of two non-identical 
portions of data producing the same hash key is lower than the probability of a 
non-detectable error on a hard disk drive (HDD) of a storage device or a non-detectable 

20 error on the data link. 

[0028] FIGs. 3A and 3B illustrate, in block diagrams, unique identifier generation 
techniques in accordance with certain implementations of the invention. In certain 
implementations of the invention, a single function (e.g., a hash function) is used to 
generate the unique identifiers (e.g., hash functions). Alternatively, as illustrated in 

25 FIGs 3Aand3B,twoormorefunction S areappliedtoaportionofdatasimultaneously 
to provide an even higher level of confidence that when the unique identifiers associated 
with the portions of data match, the portions of data are identical. For example, in block 
300, a first function is used to generate a first unique identifier (e.g., a 256 bit MD5 key). 
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Simultaneously with processing in block 300, in block 3 10, a second function is used to 
generate a second unique identifier (e.g., a 256 bit SHA-1 key). In block 320, the first 
and second unique identifiers are combined (e.g., concatenated) to form a combined 
unique identifier that is used in determining whether two portions of data are 
5 synchronized. The processing of blocks 300 and 3 1 0 may be performed in parallel or 
sequentially in any order. 

[0029] In FIG. 3B, in block 350, a first function is used to generate a first unique 
identifier (e.g., a 256 bit MD5 key) for a portion of data. In block 360, a second function 
is used to generate a second unique identifier (e.g., a 256 bit SHA-1 key) based on the 

10 first unique identifier. That is, the result of the first function, which is the first unique 
identifier, is input into the second function. The new unique identifier generated by the 
second function is used in determining whether two portions of data are synchronized. 
Any technique for generating unique identifiers for portions of data is within the scope of 
the invention, and the examples of FIGs. 3 A and 3B are not intended to limit the scope of 

1 5 the invention. 

[0030] FIG. 4 illustrates, in a block diagram, a source space 400 and a target space 450 in 
accordance with certain implementations of the invention. The source space ("S") 400 
includes a portion of data (e.g., a track) 410 of length N bits, and the target space ("T") 
450 includes a hash key 460 of K bits. Then, the source space 400 is comprised of 2 N 
20 possible combinations, and the target space 450 is comprised of 2 K possible 

combinations. A hash key function maps a combination from the source space 400 into 
the target space 450. 

[0031] Implementations of the invention are interested in knowing the probability that 
two identical hash keys are generated by different portions of data from the source space 
25 400. For ease of reference, the term "source values" will be used to refer to the 

individual portions of data in the source space 400. In certain implementations of the 
invention, an assumption is made that the hash key function maps source values from the 
source space 400 into the target space 450 uniformly. That is, the inverse hash image of 
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each element of the target space 450 is of "approximately" the same cardinality. This 
assumption is valid since the hash key generation techniques were created such that it is 
"computationally infeasible" to find two different portions of data that produce an 
identical hash key by "brute force" computational methods. In other words, the hash key 
5 function operates on source portions of data such that it is very difficult to find the exact 
combination of bits to change in the portion of data to have the function generate an 
identical hash key. Thus, the assumption that the hash key function maps source values 
from the source space 400 into the target space 450 uniformly is valid. 
[0032] With reference to uniformity, the size of the probability of collision is determined 

10 as follows. For ease of reference, the term "target value" is used herein to refer to a 
unique identifier (e.g., hash key). Since 2 N values are uniformly mapped into a target 
space 450 of 2 K , for any target value there will be 2 N /2 K , or 2 N FC collisions. The term 
collision is used herein to indicate a match. Also, a false match is said to occur when two 
different portions of data that are not identical have identical hash keys. So, for a 

1 5 uniform function, the probability of a false match (or a collision) is the total number of 
possible collisions, 2 N K - 1, divided by the total source space 400, 2 N . Therefore, this 
probability is (2 N_K - 1) / 2 N , which is approximately, for large N, 2 N_K / 2 N , which is 2" K . 
[0033] That is, certain implementations of the invention use a hash function ("h"), h: 
S->T, such that, for all hash keys ("t") in T, (max |h-l(t)|) / 2N«(probability of hard 

20 disk drive (HDD) undetectable failure rates). That is, the probability of two 

non- identical portions of data producing the same hash key is lower than the probability 
of a non-detectable error on a hard disk drive (HDD) of a storage device or a 
non-detectable error on the data link. 

[0034] Therefore, the probability of a collision is effectively only dependent on the 
25 number of bits of the hash key for a sufficiently large portion of source data. By making 
this hash key length 256 bits, the probability of collision is 2 256 or 10' 78 . For 512 bits, the 
probability is 10" 155 . These probability numbers are much, much lower than those that are 
inherent in a standard HDD or in the data transfer link between two data sites. Thus, it is 
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reasonable to say that the probability that the hash key function reports that two portions 
of data are identical when they are not is negligible compared to the probability that an 
undetectable errors indicator (e.g., bit) is written and read incorrectly from the HDD or a 
bit is corrupted while being transferred from the primary to the secondary storage control 
5 unit. That is, the probability of collision is 2 256 , and with the probabilities so low, even if 
the hash key functions are not exactly uniform, the probabilities are still much lower than 
the probability of a bit error on a hard disk drive. 

[0035] Thus, implementations of the invention reduce both total time to synchronize two 
volumes and the bandwidth necessary to do so (i.e., transferring a unique identifier, 
1 0 rather than a portion of data, reduces bandwidth requirements). The amount of time 

saved by this technique is dependent on various factors, such as the size of the portions of 
data and the number of portions of data, the bandwidth of the data link, and the hash key 
function chosen and its implementation. 

[0036] This invention reduces the amount of traffic that will flow over the data link by a 
15 factor depending on the exact configuration of the system. As an example, if the source 
data is a portion of data of 32 kilobytes (KB) and the hash key is 512 bits, or 64 bytes 
(B), the reduction in traffic on the data link is 99.8%. Of course, if any portions of data 
are found to be out of synchronization, the entire portion of data will be copied. Over the 
entire volume, however, there typically are few portions of data in need of transfer and 
20 this represents a large bandwidth savings. 

[0037] In conclusion, implementations of the invention determine whether or not two 
storage mediums (e.g., volumes) are synchronized and, if not, identify data stored in the 
two storage media that are to be synchronized. In particular, implementations of the 
invention associate an identifier with each portion of data on a primary storage device 
25 and on a secondary storage device. If the identifier of a portion of primary data matches 
the identifier of a corresponding portion of secondary data, then the portions of data are 
determined to be synchronized. If the identifier of a portion of primary data does not 
match the identifier of a corresponding portion of secondary data, then the portions of 
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data are determined to require synchronization, and the portion of primary data is copied 
over the portion of secondary data. 

Additional Implementation Details 
5 [0038] The described techniques for data synchronization may be implemented as a 
method, apparatus or article of manufacture using standard programming and/or 
engineering techniques to produce software, firmware, hardware, or any combination 
thereof. The term "article of manufacture" as used herein refers to code or logic 
implemented in hardware logic (e.g., an integrated circuit chip, Programmable Gate 

10 Array (PGA), Application Specific Integrated Circuit (ASIC), etc.) or a computer 

readable medium, such as magnetic storage medium (e.g., hard disk drives, floppy disks,, 
tape, etc.), optical storage (CD-ROMs, optical disks, etc.), volatile and non- volatile 
memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, firmware, 
programmable logic, etc.). Code in the computer readable medium is accessed and 

15 executed by a processor. The code in which preferred embodiments are implemented may 
further be accessible through a transmission media or from a file server over a network. 
In such cases, the article of manufacture in which the code is implemented may comprise 
a transmission media, such as a network transmission line, wireless transmission media, 
signals propagating through space, radio waves, infrared signals, etc. Thus, the "article of 

20 manufacture" may comprise the medium in which the code is embodied. Additionally, 
the "article of manufacture" may comprise a combination of hardware and software 
components in which the code is embodied, processed, and executed. Of course, those 
skilled in the art will recognize that many modifications may be made to this 
configuration without departing from the scope of the present invention, and that the 

25 article of manufacture may comprise any information bearing medium known in the art. 
[0039] The logic of FIGs. 2 A, 2B, 3 A, and 3B describes specific operations occurring in 
a particular order. In alternative implementations, certain of the logic operations may be 
performed in a different order, modified or removed. Moreover, operations may be added 
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to the above described logic and still conform to the described implementations. Further, 
operations described herein may occur sequentially or certain operations may be 
processed in parallel, or operations described as performed by a single process may be 
performed by distributed processes. 
5 [0040] The illustrated logic of FIGs. FIGs. 2 A, 2B, 3 A, and 3B may be implemented in 
software, hardware, programmable and non-programmable gate array logic or in some 
combination of hardware, software, or gate array logic. 

[0041] FIG. 5 illustrates an architecture of a computer system that may be used in 
accordance with certain implementations of the invention. The primary control units, 

10 secondary control units, and/or hosts may implement computer architecture 500. The 
computer architecture 500 may implement a processor 502 (e.g., a microprocessor), a 
memory 504 (e.g., a volatile memory device), and storage 510 (e.g., a non- volatile 
storage area, such as magnetic disk drives, optical disk drives, a tape drive, etc.). An 
operating system 505 may execute in memory 504. The storage 510 may comprise an 

15 internal storage device or an attached or network accessible storage. Computer programs 
506 in storage 510 may be loaded into the memory 504 and executed by the processor 
502 in a manner known in the art. The architecture further includes a network card 508 
to enable communication with a network. An input device 5 1 2 is used to provide user 
input to the processor 502, and may include a keyboard, mouse, pen-stylus, microphone, 

20 touch sensitive display screen, or any other activation or input mechanism known in the 
art. An output device 514 is capable of rendering information transmitted from the 
processor 502, or other component, such as a display monitor, printer, storage, etc. The 
computer architecture 500 of the computer systems may include fewer components than 
illustrated, additional components not illustrated herein, or some combination of the 

25 components illustrated and additional components. 

[0042] The computer architecture 500 may comprise any computing device known in the 
art, such as a mainframe, server, personal computer, workstation, laptop, handheld 
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computer, telephony device, network appliance, virtualization device, storage controller, 
etc. Any processor 502 and operating system 505 known in the art may be used. 
[0043] The foregoing description of implementations of the invention has been presented 
for the purposes of illustration and description. It is not intended to be exhaustive or to 
5 limit the invention to the precise form disclosed. Many modifications and variations are 
possible in light of the above teaching. It is intended that the scope of the invention be 
limited not by this detailed description, but rather by the claims appended hereto. The 
above specification, examples and data provide a complete description of the 
manufacture and use of the composition of the invention. Since many implementations 
10 of the invention can be made without departing from the spirit and scope of the invention, 
the invention resides in the claims hereinafter appended. 



